166 research outputs found

    A Coevolutionary Particle Swarm Algorithm for Bi-Level Variational Inequalities: Applications to Competition in Highway Transportation Networks

    Get PDF
    A climate of increasing deregulation in traditional highway transportation, where the private sector has an expanded role in the provision of traditional transportation services, provides a background for practical policy issues to be investigated. One of the key issues of interest, and the focus of this chapter, would be the equilibrium decision variables offered by participants in this market. By assuming that the private sector participants play a Nash game, the above problem can be described as a Bi-Level Variational Inequality (BLVI). Our problem differs from the classical Cournot-Nash game because each and every player’s actions is constrained by another variational inequality describing the equilibrium route choice of users on the network. In this chapter, we discuss this BLVI and suggest a heuristic coevolutionary particle swarm algorithm for its resolution. Our proposed algorithm is subsequently tested on example problems drawn from the literature. The numerical experiments suggest that the proposed algorithm is a viable solution method for this problem

    A textual-based featuring approach for depression detection using machine learning classifiers and social media texts

    Get PDF
    Depression is one of the leading causes of suicide worldwide. However, a large percentage of cases of depression go undiagnosed and, thus, untreated. Previous studies have found that messages posted by individuals with major depressive disorder on social media platforms can be analysed to predict if they are suffering, or likely to suffer, from depression. This study aims to determine whether machine learning could be effectively used to detect signs of depression in social media users by analysing their social media posts—especially when those messages do not explicitly contain specific keywords such as ‘depression’ or ‘diagnosis’. To this end, we investigate several text preprocessing and textual-based featuring methods along with machine learning classifiers, including single and ensemble models, to propose a generalised approach for depression detection using social media texts. We first use two public, labelled Twitter datasets to train and test the machine learning models, and then another three non-Twitter depression-class only datasets (sourced from Facebook, Reddit, and an electronic diary) to test the performance of our trained models in other social media sources. Experimental results indicate that the proposed approach is able to effectively detect depression via social media texts even when the training datasets do not contain specific keywords (such as ‘depression’ and ‘diagnose’), as well as when unrelated datasets are used for testing

    A Multi-type Classifier Ensemble for Detecting Fake Reviews Through Textualbased Feature Extraction

    Get PDF
    The financial impact of online reviews has prompted some fraudulent sellers to generate fake consumer reviews for either promoting their products or discrediting competing products. In this study, we propose a novel ensemble model - the Multitype Classifier Ensemble (MtCE) - combined with a textual-based featuring method, which is relatively independent of the system, to detect fake online consumer reviews. Unlike other ensemble models that utilise only the same type of single classifier, our proposed ensemble utilises several customised machine learning classifiers (including deep learning models) as its base classifiers. The results of our experiments show that the MtCE can adequately detect fake reviews, and that it outperforms other single and ensemble methods in terms of accuracy and other measurements in all the relevant public datasets used in this study. Moreover, if set correctly, the parameters of MtCE, such as base-classifier types, the total number of base classifiers, bootstrap and the method to vote on output (e.g., majority or priority), further improve the performance of the proposed ensemble

    Incorporating feature ranking and evolutionary methods for the classification of high-dimensional DNA microarray gene expression data

    Get PDF
    Background: DNA microarray gene expression classification poses a challenging task to the machine learning domain. Typically, the dimensionality of gene expression data sets could go from several thousands to over 10,000 genes. A potential solution to this issue is using feature selection to reduce the dimensionality. Aim The aim of this paper is to investigate how we can use feature quality information to improve the precision of microarray gene expression classification tasks. Method: We propose two evolutionary machine learning models based on the eXtended Classifier System (XCS) and a typical feature selection methodology. The first one, which we call FS-XCS, uses feature selection for feature reduction purposes. The second model is GRD-XCS, which uses feature ranking to bias the rule discovery process of XCS. Results: The results indicate that the use of feature selection/ranking methods is essential for tackling high-dimensional classification tasks, such as microarray gene expression classification. However, the results also suggest that using feature ranking to bias the rule discovery process performs significantly better than using the feature reduction method. In other words, using feature quality information to develop a smarter learning procedure is more efficient than reducing the feature set. Conclusion: Our findings have shown that extracting feature quality information can assist the learning process and improve classification accuracy. On the other hand, relying exclusively on the feature quality information might potentially decrease the classification performance (e.g., using feature reduction). Therefore, we recommend a hybrid approach that uses feature quality information to direct the learning process by highlighting the more informative features, but at the same time not restricting the learning process to explore other features

    Cascaded channel estimation technique for massive MIMO relay system

    Get PDF
    © Published under licence by IOP Publishing Ltd. This paper concerns with the cascaded channel estimation of massive multiple-input multiple-output (MIMO) relay system. In order to meet the increasing demands for high-speed wireless communication networks, massive MIMO has been recognized as one of the key technologies for the future fifth generation (5G) cellular networks. It is an advanced MIMO technique consists of a very large number of antennas at the base station and serves a smaller number of single-antenna users simultaneously. Basically, the idea of massive MIMO technique is to harvest all the advantages of conventional MIMO system in a much larger scale. To reap the benefits of massive MIMO in practice, an accurate estimation of the channel state information (CSI) is needed. In this paper, the relaying technique has been incorporated with massive MIMO system in order to increase system throughput and improve the coverage in cell-edge users. A relay node is placed in between transmitter and receiver to reduce the path loss and improve the spectral efficiency of massive MIMO system. Cascaded channel estimation technique for massive MIMO relay system is developed in this paper. The mean squared error (MSE) of the cascaded channel estimation for massive MIMO relay sytem is optimized to obtain accurate CSI

    Identifying the high-value social audience from Twitter through text-mining methods

    Get PDF
    Doing business on social media has become a common practice for many companies these days. While the contents shared on Twitter and Facebook offer plenty of opportunities to uncover business insights, it remains a challenge to sift through the huge amount of social media data and identify the potential social audience who is highly likely to be interested in a particular company. In this paper, we analyze the Twitter content of an account owner and its list of followers through various text mining methods, which include fuzzy keyword matching, statistical topic modeling and machine learning approaches. We use tweets of the account owner to segment the followers and identify a group of high-value social audience members. This enables the account owner to spend resources more effectively by sending offers to the right audience and hence maximize marketing efficiency and improve the return of investment

    Using support vector machine ensembles for target audience classification on Twitter

    Get PDF
    The vast amount and diversity of the content shared on social media can pose a challenge for any business wanting to use it to identify potential customers. In this paper, our aim is to investigate the use of both unsupervised and supervised learning methods for target audience classification on Twitter with minimal annotation efforts. Topic domains were automatically discovered from contents shared by followers of an account owner using Twitter Latent Dirichlet Allocation (LDA). A Support Vector Machine (SVM) ensemble was then trained using contents from different account owners of the various topic domains identified by Twitter LDA. Experimental results show that the methods presented are able to successfully identify a target audience with high accuracy. In addition, we show that using a statistical inference approach such as bootstrapping in over-sampling, instead of using random sampling, to construct training datasets can achieve a better classifier in an SVM ensemble. We conclude that such an ensemble system can take advantage of data diversity, which enables real-world applications for differentiating prospective customers from the general audience, leading to business advantage in the crowded social media space

    Vehicle recognition system using RFID technology for parking management system

    Get PDF
    © Published under licence by IOP Publishing Ltd. Technology is growing each day by leaps and bounds. The change in technology has done wonders to increase productivity in everyday life. However, drivers searching for parking contributed to traffic congestion especially in urban and suburban cities, while consuming their time and energy. Besides, traffic congestion in parking area is always an issue which contribute to traffic on the main road. The current approach to overcome this includes the implementation of parking ticket system, especially in busy area. The hour rate for busy area is elevated to promote liquidation of vehicles. However, this system requires patrol warden to manually inspect each vehicle for the parking ticket. The feasibility of the system decreases when the parking area is larger. RFID parking management system aims to bring automation into parking system. The use of passive UHF RFID which can read multiple tags from long distance can replace patrol warden and thus increase the effectiveness of the Vehicle Parking Management System (VPMS). Furthermore, the power consumption of RFID system is analyzed. A new method has been introduced to reduce the power consumption of RFID system which operates 24/7. Experimental results demonstrate the effectiveness of the proposed method in reducing the power consumption of the RFID system

    Effects of training datasets on both the extreme learning machine and support vector machine for target audience identification on twitter

    Get PDF
    The ability to identify or predict a target audience from the increasingly crowded social space will provide a company some competitive advantage over other companies. In this paper, we analyze various training datasets, which include Twitter contents of an account owner and its list of followers, using features generated in different ways for two machine learning approaches - the Extreme Learning Machine (ELM) and Support Vector Machine (SVM). Various configurations of the ELM and SVM have been evaluated. The results indicate that training datasets using features generated from the owner tweets achieve the best performance, relative to other feature sets. This finding is important and may aid researchers in developing a classifier that is capable of identifying a specific group of target audience members. This will assist the account owner to spend resources more effectively, by sending offers to the right audience, and hence maximize marketing efficiency and improve the return on investment
    • …
    corecore